<html><head><title>Data Operations Linguist (Phonetics) - Menlo Park, CA</title></head>
<body><h2>Data Operations Linguist (Phonetics) - Menlo Park, CA</h2>
<div>As a Data Operations Linguist in Phonetics, you will transcribe, annotate, and curate speech data sets to drive improvements to our ASR and TTS products. You will focus on creating high quality speech data through transcription, guideline authoring and management, annotator training, and working with cross-functional teams to coordinate quality and timeliness. You will use your analytical mindset, strong communication skills, collaborative attitude, and personal drive to achieve team goals. We value expertise in any subfield of phonetics (articulatory, acoustic, neurolinguistics, etc.), expect a passion for the scientific study of language, and require research experience that includes a thorough understanding of the scientific method and experimental design. We are seeking new colleagues who work well on teams, are open to different viewpoints, and can quickly agree on the most optimal solutions to speech data problems.
</div><div><div>RESPONSIBILITIES</div><div></div><div><div><ul><li><div><div>
Transcribe and annotate speech data for large-scale ASR and TTS projects.</div></div></li></ul><div></div><ul><li><div><div>
Select corpus material for voice recordings.</div></div></li></ul><div></div><ul><li><div><div>
Reason about transcription and annotation guidelines and identify issues and flaws.</div></div></li></ul><div></div><ul><li><div><div>
Analyze system metrics such as user opinion, lexicon transcription coverage, and POS tagger performance and identify pain points.</div></div></li></ul><div></div><ul><li><div><div>
Translate data requirements into annotation guidelines and other documentation.</div></div></li></ul><div></div><ul><li><div><div>
Work in concert with other linguists, as well as data scientists, engineers, and project managers to achieve project goals.</div></div></li></ul><div></div><ul><li><div><div>
Manage data creation and annotation efforts with third-party partners.</div></div></li></ul><div></div><ul><li><div><div>
Use data manipulation tools and scripts to move quickly and accurately at scale.</div></div></li></ul><div></div><ul><li><div><div>
Continually develop an understanding of the relationship between data and machine learning models.</div></div></li></ul><div></div><ul><li><div><div>
Guide collaboration with cross-functional teams in shared work, existing initiatives, or new initiatives.</div></div></li></ul></div></div></div><div><div>
MINIMUM QUALIFICATIONS</div><div></div><div><div><ul><li><div><div>
Degree in Linguistics, Language Technologies, Computational Linguistics, Speech Science, related field, or equivalent industry experience.</div></div></li></ul><div></div><ul><li><div><div>
Experience with phonetics and other various areas of linguistics such as phonology, sociolinguistics, dialectology, computational linguistics, and field work.</div></div></li></ul><div></div><ul><li><div><div>
Experience using the command line, GUI, and database tools.</div></div></li></ul><div></div><ul><li><div><div>
Experience with waveforms and spectrograms.</div></div></li></ul><div></div><ul><li><div><div>
Experience in transcription and annotation systems such as SAMPA, IPA, and ToBI.</div></div></li></ul><div></div><ul><li><div><div>
Experience working on or leading cross-functional efforts, initiatives, or projects.</div></div></li></ul><div></div><ul><li><div><div>
Experience with prioritizing multiple work streams and conduct day-to-day tasks without oversight.</div></div></li></ul><div></div><ul><li><div><div>
Experience with forming internal team relationships and foster external relations.</div></div></li></ul><div></div><ul><li><div><div>
1+ years experience with linguistic data annotation, quality, and metrics.</div></div></li></ul></div></div></div><div><div>
PREFERRED QUALIFICATIONS</div><div></div><div><div><ul><li><div><div>
Advanced degree in Linguistics, Language Technologies, Computational Linguistics, Speech Science, or related field.</div></div></li></ul><div></div><ul><li><div><div>
Experience defining quality expectations for annotation guidelines and data.</div></div></li></ul><div></div><ul><li><div><div>
Experience developing and evaluating data annotation metrics.</div></div></li></ul><div></div><ul><li><div><div>
Experience with basic procedural scripting.</div></div></li></ul><div></div><ul><li><div><div>
Experience using metrics to identify high-level data solutions, communicate them to cross-functional partners, and implement them.</div></div></li></ul><div></div><ul><li><div><div>
Experience improving high-level data management processes and workflows.</div></div></li></ul><div></div><ul><li><div><div>
Experience with project scaling and language expansion.</div></div></li></ul><div></div><ul><li><div><div>
2+ years relevant industry experience.</div></div></li></ul></div></div></div><div><div>
Facebook is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, you may contact us at accommodations-ext@fb.com.</div></div></body>
</html>